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Abstract 

We theoretically and numerically investigated the threshold network model 
with a generic weight function where there were a large number of nodes 
and a high threshold. Our analysis was based on extreme value theory, 
which gave us a theoretical understanding of the distribution of independent 
and identically distributed random variables within a sufficiently high range. 
Specifically, the distribution could be generally expressed by a generalized 
Pareto distribution, which enabled us to formulate the generic weight distri- 
bution function. By using the theorem, we obtained the exact expressions 
of degree distribution and clustering coefficient which behaved as universal 
power laws within certain ranges of degrees. We also compared the theoret- 
ical predictions with numerical results and found that they were extremely 
consistent. 
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1. Introduction 

Many researchers during the last decade have enthusiastically investigated 
the structure of real networks that have emerged from various disciplines, 
such as information science, sociology, and biology. As a result of these in- 
vestigations, some have found two features inherent in their networks, i.e., 
small-world [l[ and scale-free j^j, which were vastly different from those of 
the regular and random networks already known. The term "small-world" 
means that the clustering coefficient remains large and that there are very 
short paths between every two nodes, and "scale-free" means that the degree 
distribution has a long tail, which usually obeys a power law. All networks 
having both (or either) of these are often called complex networks. Many 
physicists have tried to theoretically understand their features by using math- 
ematical models. 

The threshold network model is a non-growing network model that ex- 
plains the properties of complex networks. Each node in this model has an 
intrinsic weight that is randomly distributed according to a certain probabil- 
ity distribution function. Every pair of nodes satisfying the condition that 
the sum of their weights is over a certain given threshold is linked. When the 
weights are generated from various types of restricted distribution (specifi- 
cally, exponential or power-law type distributions), both the degrees of nodes 
and the degree-wise clustering coefficient obey universal power laws at the 
high tails of the degrees p, 0, III- We investigated the scale-free feature for 
both the degrees and clustering coefficient in a wider variety of weight distri- 
bution functions using extreme value theory 0, 0, S]. Extreme value theory 
provides an explanation of the distribution of independent and identically 
distributed (iid) random variables within a sufficiently high range. Accord- 
ing to this theory, the distribution is universally expressed with a generalized 
Pareto distribution (GPD). This gives the expression of a generic weight dis- 
tribution function, which finally results in universal power laws, which is the 
main contribution of this paper. 

This paper is organized as follows. We briefly explain the threshold net- 
work model in Sec. [2j then give the formulas for the distributions of degrees 
and clustering coefficient. In Sec. EU we briefly review extreme value theory. 
We present the theoretical results of universal power laws in Sec. H] and com- 
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pare the theory with numerical results in Sec. We finally conclude with a 
summary and a discussion in Sec. El 



2. Threshold Network Model 

First, we will explain the threshold network model 0, 0, [|| with n(> 1) 
nodes. We assign a weight, Wi (1 < i < n), to each node by using an iid 
random variable from a certain weight distribution function, f(w). When 
the sum of two weights Wi, Wj (i ^ j) in every pair of nodes is more than or 
equal to a given threshold, 9, i.e., 

Wi + Wj>9. (1) 

an edge is formed between them. Going through this procedure, we can 
finally obtain a threshold network. 
Let 

/w 
f(w')dw' (2) 
■oo 

be a weight function. Here, we assumed that the threshold network had 
a large number of nodes, n. From the definition of the threshold network, 
nodes with the same weight values have an equal number of degrees, and the 
degrees of nodes monotonically increase as their weight values increase. Due 
to these features, the degree of the threshold model is specifically described 
in the continuous limit of n as 

/oo 
f(w')dw' = n{l- F(9-w)}, (3) 

where < k < n. This equality gives a one-to-one correspondence between 
the degree, k, and the weight, w. Using Eq. ([3]), the degree distribution, p(k) 
(0 < k < n), naturally follows 



dw 



/('-^(i-i)) 



p(k) = f(w)— = ^- / // ■ (4) 



d,k 



The degree-wise clustering coefficient, C(k), means that the clustering coef- 
ficient of nodes with degree k also leads to 
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C(k) 



k(k-l) 




) 



) 



(5) 



3. Extreme Value Theory 

Next, let us review the essence of extreme value theory. This theory 
mathematically treats the statistical behavior of the maximum 



where (1 < i < n) is an ensemble of iid random variables given by a 
common distribution function, F(x). In a linearly renormalized maximum, 
there exists the following limit distribution 



where z is defined as {z : l + — \i)j<J > 0}, — oo < ji < oo is a location pa- 
rameter, a > is a scale parameter, and — oo < £ < oo is a shape parameter. 
This is called a distribution of generalized extreme values (GEVs). Note that 
the GEV distribution includes three statistical distributions: Frechet (£ > 0), 
Gumbel (£ = 0), and (reversed, meaning x — > —x) Weibull (£ < 0). As listed 
in Table dj the maxima of various types of distribution function converge to 
a GEV distribution (for more details, see pp. 153-157 in Embrechts et al. 



It is well known in extreme value theory that the following theorem is 
satisfied using G(x). 

Theorem 1. Assume that the sample maxima of F(x) converge to G(x). 
Then, for sufficiently large u, a distribution function H of Y = X — u, 
conditional on X > u, yields 



M n = max{V 1; ■ • -,X n }, 



(6) 





B). 



H u {y) 



Pr{X <y + u\ X >u] 
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Tabic 1: Relation between types of distribution function F(x) and shape parameter £. 



Name 


Distribution Function F(x) 


Shape Parameter £ 


Beta 


I>+6) p /a-l/i Aft-lJ / 

r(o)r(6+i) Jo x v 1 x > ax ' 
< x < 1, a, 6 > 0. 


-1/6 


Uniform 


x, < x < 1. 


-1 


Exponential 


l-exp(-Ax), x > 0, A > 0. 





Normal 


— oo < x < oo, — oo < m < oo, s > 0. 





Lognormal 


/ a! s ^_exp{ (lux' M) 2 /2S 2 }dx'/x', 
x > 0, -oo < M < oo, 5 > 0. 





Pareto 


1 - rr _Q , rr > 1, a > 0. 


l/a 
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on {z : l + £y/<7>0}, where a = a + £(w — ji). 

The last equality in Eq. ([8]) is called a generalized Pareto distribution (GPD). 
For details of the proof, see pp. 76-77 in Coles 0] or pp. 10-11 in de Haan 
and Ferreira [8|. 

We rearrange Eq. with respect to F(x), where x = y + u. It then 
follows that 

F(x) = {l-F(u))H u {x-u) + F(u), 

- 1 - (1 - F(u)) (l + ifcpT) V \ ( 9 ) 

for x > u. Here, assume F(x) in Eq. (Q is a weight distribution function of 
the threshold network in Eq. (TS]). Then, Eq. represents a generic weight 
distribution function under the conditions where the number of nodes n and 
the lower bound, u, in Eq. ([9]) are sufficiently large. 



4. Main Results 

4-1. Degree Distribution 

To calculate the degree distribution in Eq. (j4j) with the generic weight 
function in Eq. ([9]), we first need to derive f(x) and F _1 (l — k/n). The 
former can easily be calculated by differentiating Eq. ([9]) : 



/Ws ^ w = iz|M( 1 + «^j , (io) 

for x > u. The latter can also be calculated by using Eqs. ([3]) and (Q. 
Rearranging Eq. ([3]) with respect to 9 — w, we have 

F-^l (11) 
Here, substituting Eq. ([9]) with x = 9 — w (> u) into Eq. ([3]), it follows that 

_ = i-F(e-w) 

n 

= (l-F(u))(l + ^(9-w-u)) \ (12) 



i- 1 
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where 9 — w > u. Rearranging the last equality in Eq. (1121) with respect to 
9 — w, and then substituting this resulting equation into the right-hand side 
of Eq. ((TTD, it follows that 



i-* 

n 



a 



k/n 



i \l-F(u) 



- 1 



u. 



(13) 



Therefore, by using Eqs. (fTUl) and (flBl . the degree distribution of the generic 
weight function is finally given by 



1 



p(k) = - 



?? 



2u) 4 



k/n 



a) \l - F{u) 



(14) 



As we can see from Eq. (fT4"|) . the degree distribution obeys a power law, 
p[k) ~ A; _ ( 1+ ^ when £ 7^ 0. For £ = 0, the distribution becomes 



p(fc) = d(k) exp{— c(k)}, 

nil- F(u)) 2 exp{-^} 



k 2 



(15) 



for £ — > 0, where 
c(*) 



1 



-2m)^ 



/c/n 



2 + (6-2u)± 



/c/n 



- 1 



To summarize, we analytically obtain the universal power law for the range 
of large degree, 



P (k) ~ *r< 1+ « (£^0), 

~ A;" 2 (£ = 0). 



(16) 
(17) 



Here, note that Eqs. fflQ|) and ffl2|) that we used to obtain the power law 
are satisfied for x > u and 9 — w > u, respectively. Therefore, the applicable 
range of weight in the power law is given by 



u < w < 9 — u. 



;is) 
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Equivalently, that of degrees is also obtained by using Eq. (j2D 
n{l - F(6 - «)} < fc < n{l - F(u)}. 



(19) 



These inequalities indicate the applicable range of degree-distribution scales 
with the number of nodes, n. To keep the inequalities in Eq. (fl8l) unbroken, 
threshold 9 must satisfy 9 > 2it, where w is sufficiently large. This indicates 
that we have to implicitly assume a high threshold to universally observe the 
power laws. 

4-2. Clustering Coefficient 

We next calculate the clustering coefficient in Eq. (jSj) with the generic 
weight function in Eq. This automatically means that the clustering 
coefficient is also restricted in the range of Eqs. (fl9l and ([15]) . resulting in 
a high threshold. First, we need to check whether the lower bound of the 
integral range in Eq. (jSJ) is within the range in Eq. (fl9|) . Using Eqs. (TTTT) and 
(USD, it follows that 



This means that the lower bound is inside the domain in Eq. (fl9|) . and 
therefore the integral can successfully be calculated without violating the 
requirements of extreme value theory. Combining Eqs. (Q and (fl3l) . it 
follows that 





> n{l-F(9-u)}, 




(20) 



where 




(21) 



Therefore, the clustering coefficient is given by 




(22) 
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for large k, where p(k) is the same as in Eq. (|14p . When £ = 0, Eq. (1201) with 
Eq. (121 p can be explicitly calculated as 



r ( v f-M l - - 



?? 



i (l-FH) 2 nexp{-^} 
k 



(23) 



When £ = 0, therefore, 



( Q-2u k 
x 1 + — - — + 2 log 



cr n(l-F(u)), 
~ A;" 2 . (24) 

To sum up, the clustering coefficient also obeys a universal power law ex- 
pressed by 

C(k) ~ k- 1 (e^O), (25) 
- ^ 2 (^ = 0). (26) 

Finally, we should mention C(fc) = 1 is always satisfied in the range of 
degree k < n{l — F(8/2)} or equivalently when the weight w < 9/2. Since 
u < 8/2 < 6 — u, it immediately follows that 9 > 2u. Therefore, 

n{l - F(6 - u)} < n{l - F(6/2)} < n{l - F(u)}. (27) 

These results mean that C(k) = 1 for n{l-F(6-u)} <k< n{l-F(6/2)}, 
and more importantly, the range of the clustering coefficients in Eqs. (1250 
and ff26|) is modified by 

n{\- F(9/2)} <k<n{\- F(u)}. (28) 

As well as the case of the degree distribution, the applicable range of the 
clustering coefficients is also scaled with the number of nodes, n. 
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5. Numerical results 



This section presents numerical results for the distributions of degrees 
and clustering coefficients to compare them with the theoretical results ob- 
tained in the previous section. To observe power laws, we must select the 
values of parameters n, 9, and u optimally with the next two items in mind. 
First, 9 and u are satisfied by the implicit condition, 9 > 2u, that prevents 
the applicable ranges in Eqs. (fT9j) and ff28]) from diminishing. Second, as 
the ranges increase as n and 9 — u increase, we need to give them as numer- 
ically large values as possible to identify the theoretical curves in Eqs. (JHJ), 
( I15p . (|22p . and (T21]) clearly in the corresponding ranges. We determine the 
three parameters n, 9, and u using the following procedures. From the shape 
of the weight distribution function, appropriate u and a are calculated us- 
ing the fExtremes package 0] in R language 10]. When the value of u is 



fixed, we give a large 9 while retaining the condition 9 > 2u and gradually 
increase n until the universal power laws can be observed widely as well. 
Typical results where the weights are given by the normal and the lognormal 
distributions (£ = 0), the standard Frechet distribution (£ > 0), and the 
uniform distribution (£ < 0) are given in Figs. [1] and [2j As can be seen 
from these figures, the theoretically predicted curves nicely fit the numerical 
results, meaning Eqs. (JHj), (fToT) . fl22J), and ([23]) are all accurate. Each 
line of power law k~ s with s — 0, 1, and 2 in the figure indicates the uni- 
versal power law in Eqs. (flBT) . (jT7|) . (|25|) . and (j26|) . These lines are also 
extremely consistent with the numerical results. The plots of the numerical 
results around the lower bounds of the ranges appear to deviate from being 
linear, i.e., the power laws, because the original formulas for the degree dis- 
tribution in Eq. (j!4p and the clustering coefficient in Eq. (I22p are only valid 
in large-degree regions. Therefore, all the deviations result from small-degree 
effects. 

Consequently, we evaluated the theoretical predictions in Sec. 0] and found 
they were in good agreement with the results obtained from numerical ex- 
periments in both the distributions of degrees and clustering coefficients. 



6. Summary and Discussion 

We analyzed the threshold network model based on extreme value theory. 
Where the number of nodes, n, and threshold 9 were sufficiently large, the 
weight distribution function, F(w), could be assumed to belong to a family 
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of the generic weight function in Eq. for x > u as long as the limit 
distribution of maximum values for F(w) converged to the distribution of 
GEVs in Eq. (jTJ). As previously mentioned, many important weight functions 
are classified into this family; therefore, we could comprehensively treat the 
model with various weight functions. We then calculated the exact forms of 
the distributions of degrees and clustering coefficients by applying the generic 
weight function in Eq. (JSJ). As a result, we found theoretical evidence for the 
universal power laws in Eqs. (fl9l) and (|28|) . which was extremely consistent 
with the results obtained by Masuda et al. |4(. We also demonstrated that 
even though the theoretical prediction assumed a sufficiently large number 
of nodes, n, threshold value, 9, and degree, k, the theoretical results could 
be reproduced through numerical experiments. This fact could explain that 
real complex networks with a large number of nodes and a high threshold (if 
they exist) could universally appear with scale-free properties. 

We conclude by referring to recent mathematical studies on threshold 



network models [11|, ll3|- We think that our investigation was important 
as a prototype to apply extreme value theory to various other models of 
complex networks. In future work, it would be interesting to reformulate 
our method mathematically to find the possibilities of extending its range of 
applications. 
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Figure 1: Numerical results for the degree distribution p(k). (a) Weights are given by the 
normal distribution with m = 0, s = 1 in Tablc[T](£ = 0) and n = 10 5 , 9 = 5, u = 1.65, and 
a = 0.47. (b) Lognormal distribution with M = 0, S = 1 in Table [1] = 0) and n = 10 5 , 
9 = 30, u = 5.19, and a — 2.55. (c) Standard Frechet distribution F(w) = exp(— 1/x) 
= 1) and n = 10 5 , 9 = 1000, u = 19.7, and a = 20.34. (d) Uniform distribution in 
Table [T] = -1) and n = 10 4 , 9 = 0.95, u = 0.05, and a = 0.95. The two vertical 
dotted lines in each figure indicate the lower and upper bounds of the applicable range in 
Eq. (Unj). The theoretical prediction in Eq. (Ti~4"|) or (TTS")) is indicated by the solid curves, 
and power laws k~ s with s = or 2 are indicated by the dashed lines. 
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Figure 2: Numerical results for the degree- wise clustering coefficient C(k). (a) Weights 
are given by normal distribution with m = 0, s = 1 in Table [T] (£ = 0) and n = 10 4 , 
= 4.9, u = f .64, and = 0.50. (b) Lognormal distribution with M = 0, S = 1 in Tabled] 
(£ = 0) and n = 10 4 , ^ = 20, u = 5.18, and cr = 2.69. (c) Standard Frechet distribution 
F(w) = exp(-l/x) (( = 1) and n = 10 4 , = 100, u = 19, and a = 20.20. (d) Uniform 
distribution in Tabled] {£ = -1) and n = 10 4 , 9 = 0.95, u = 0.05, and a = 0.95. The two 
vertical dotted lines in each figure indicate the lower and upper bounds of the applicable 
range in Eq. (f2"B"|) . The theoretical prediction in Eq. (f2"2"|) or is indicated by the solid 
curves, and the power laws k~ s with s = 1 or 2 are indicated by the dashed lines. 
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